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The one-shot success probability of a noisy classical channel for transmitting one classical bit is the 
optimal probability with which the bit can be sent via a single use of the channel. Prevedel et al. 
(PRL 106, 110505 (2011)) recently showed that for a specific channel, this quantity can be increased 
if the parties using the channel share an entangled quantum state. We completely characterize the 
optimal entanglement-assisted protocols in terms of the radius of a set of operators associated with 
the channel. This characterization can be used to construct optimal entanglement-assisted protocols 
from the given classical channel and to prove the limit of such protocols. As an example, we show 
that the Prevedel et al. protocol is optimal for two-qubit entanglement. We also prove some simple 
upper bounds on the improvement that can be obtained from quantum and no-signaling correlations. 



Suppose that two parties, Alice and Bob, communicate 
over a noisy classical channel. While there are many ex- 
amples of how Alice and Bob may benefit when they up- 
grade to a quantum channel, examples in which shared 
entanglement improves communication over a classical 
channel have only recently been discovered 043 ■ That 
these examples exist at all is somewhat surprising, as nei- 
ther shared entanglement Q nor the assistance of non- 
signaling correlations 0] can increase the classical capac- 
ity of the channel. So far, work in this direction has 
focused on the the (one-shot) zero error capacity, which 
measures the number of messages Alice can send to Bob 
perfectly [H, 0-0] , and the related notion of the one-shot 
success probability 0], which is the best probability with 
which Alice can successfully send a single bit to Bob. 

It is of interest to determine how shared entanglement 
affects these two quantities, as this will further our un- 
derstanding of how resources from quantum mechanics 
can be used for communication. 

Previous work on enhanced communication over a clas- 
sical channel has focused on the assistance that can 
be provided by non-signaling correlations. In this set- 
ting, both the zero error capacity and one-shot suc- 
cess probability can be written as the solution to lin- 
ear programs 0, 0. Some upper bounds are known 
for the entanglement assisted zero error capacity [f|; 
these bounds are often the best bounds available in the 
unassisted case, suggesting that there are strong limi- 
tations to the amount of assistance that entanglement 
can provide. Much less is known about the limits of 
quantum assistance for the one-shot success probabil- 
ity. In . Prevedel et al. give an example of a channel 
where the unassisted success probability, Succ(A), the 
entanglement-assisted success probability Succq (N) , and 
the non-signaling assisted success probability Succns 
are all different. It is known that entanglement cannot 
be completely helpful: if Succ(iV) is less than one, then 
so is Succq(JV) 0. However, the size of the gap between 
them has remained unquantified. 



We use two distinct approaches to quantify the extent 
to which entanglement can help Alice and Bob. In our 
first approach, we derive a simple formula for Succq(A) 
in terms of the dimension of the entanglement. This for- 
mula, which is given by maximizing a quantity over a 
family of positive semidefinite operators, is easy to work 
with, and as an example of its applicability, we show that 
the protocol from Q is in fact optimal for their channel 
and for 2-dimcnsional entanglement assistance. 

While our first approach is quite general, it does not 
give a closed form for the success probability. Our second 
approach obtains explicit closed-form upper bounds for 
the success probability. As a first step, we prove the 
following general bound on non-signaling assistance. Let 
r be the number of elements in the input alphabet of N. 
Then, 

Succ N s(A) - i 2 

^ L 2 < 2 (i) 

Succ(TV) -\ ~ r w 

The quantity (Succ(A) — |) measures the advantage 
that Alice and Bob have over a random strategy; thus, 
(HJ) measures the additional advantage gained by non- 
signaling correlations. Our proof of ([TJ uses the linear 
program characterization of Succns(-^0 from [f|. From 
this, we derive an upper bound on the amount of assis- 
tance from a binary quantum device; we use the fact that 
any quantum correlation can be decomposed into a prob- 
abilistic mixture of a local correlation and a non-signaling 
correlation (the concept of local fraction) . We show that 
both of these bounds are the best possible, in the sense 
that there are channels for which equality is achieved. 

A common thread in both approaches above is the use 
of the radius of a subset of a normcd vector space. Our 
formula for Succq(jV) depends on maximizing the ra- 
dius of a family of Hcrmitian operators. In the second 
approach we use a formula for Succns(-'V) ( an alternate 
formulation of Proposition 14 from |5| ) which is expressed 
in terms of the radius of a particular set of vectors. 
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Notation and terminology. Throughout this paper, 
we assume that Alice is trying to transmit a single bit 
to Bob across a classical channel. Alice and Bob will 
have access to a two-part input output device D (Figure 
[T]), which may be classical, quantum, or implement an 
arbitrary non-signaling correlation. Each two-part input 



of {Hi} is equal to 



Alice 



Da 



Bob 



FIG. 1. A two-part input output device. 

output device D gives rise to a correlation between Alice 
and Bob, given by 

{D(pq\rs) | p G V, q G Q, r G K, s G S} 

so that D(pq\rs) is the probability of outputs p and q 
given inputs r and s. We will abuse notation by identi- 
fying the device D with the correlation it induces. 

We say a device is non-signaling if the partial sums 
J2 P £V D(pq\rs) do not depend on r, and the partial 
sums X] 9 efi> D(pq\rs) do not depend on s. We say a 
non-signaling device D is quantum if there exist Hib- 
crt spaces Va and Vb, families of POVMs {{A£} p } r 
{{B1} q } s , and a density operator A on Va <£> Vb such 
that D(rs\pq) = Tr((^ <g> Bj)A). The device is quantum 
with dimension n if both Va and Vb are n-dimensional, 
and binary if the input and output alphabets have size 2. 

A classical channel N is given by a matrix of condi- 
tional probabilities {N(y\x) \ x G X,y G V}, where 
N(y\x) is the probability of seeing an output y G y given 
the input x € X . For any channel N, let Succ(A) denote 
the maximum probability with which a single bit can be 
sent across N (without assistance). Let Succ(A^, D) de- 
note the maximum probability for a single-bit transmis- 
sion across N with the assistance of D. If S is a set of 
two-part devices, write Succs(A) := sup SgS Succ(A", S). 
We will be concerned with three choices of S. We con- 
sider the set NS of non-signaling devices; the sets Q and 
Q(n) of quantum and n-dimensional quantum devices; 
and the set Q b of binary quantum devices. 

General quantum devices. In this section, we derive a 
formula for Succq(„)(7V), and give an example of how 
to use our formula. We will use the radius of a fi- 
nite set {Hi\i(zz of Hermitian operators on a finite- 
dimensional Hilbert space V, defined by Rad{i/i} 2 ; := 
mine maxi \\Hi — C\\ where the minimum is taken over 
all Hermitian operators C on V . The following lemma, 
which is proved in section 1 of the supplementary mate- 
rial, gives an alternative expression for the radius. 

Lemma 1. For any finite set {H{\i^x of Hermitian oper- 
ators on a finite- dimensional Hilbert space V , the radius 



max 

A«>0,A<>0 
Tr(£A0=l/2 



Here, the maximization is over all Hermitian opera- 
tors {Aijigi and {A^jigi on V satisfying the given con- 
straints. 

Using this lemma, we will prove the following theorem, 
which characterizes SuccQ( n ) (N). 

Theorem 2. For any channel N , and any integer n > 2. 
Succ Q(n) (N) = \ + max Rad \ £ N(y\x)B y \ 

{y} \ [y^y J. g J 

where the maximization is over all families {B y } ye y of 
Hermitian operators on <C n satisfying < B y < I. 

Proof. Consider the following quantum-assisted protocol 
for transmitting a single bit across N. Alice and Bob pos- 
sess a bipartite quantum system represented by a density 
matrix A on a Hilbert space Va®Vb- Alice wishes to 
transmit a message a G {0, 1}. Depending the value of 
a, she applies one of two possible POVMs {Ag} xe ^ or 
{Af }xtx to Va and sends the result of the measurement 
to the channel N. Bob receives the output y of the chan- 
nel, and according to this output, applies one of a family 
of binary POVMs B$}} to V B . The result of 

this output is Bob's guess at Alice's original message. 

In order to compute the success probability for this 
protocol, it is not necessary to know the state A or the 
operators {A^\ x ^ a : it is only necessary to know the oper- 
ators p x a := Tia \{A% <E> I) A] , which represent the state of 
Bob's quantum system when the outcome of Alice's mea- 
surement is x. These operators satisfy J2 X Po = J2x Pi 
and Tt(pq) = 1, and, in fact, any family of operators sat- 
isfying those two constraints can be induced by an appro- 
priately chosen state A and appropriately chosen POVMs 
{Agj-Eg^t or {Af} xe ^. Thus, for our purposes, to spec- 
ify an (n, n)-dimensional entanglement-assisted strategy 
for communicating a single bit across N, it suffices to 
specify a collection of binary POVMs {{By,Bl}} 
on C" and a collection of positive semidefinite operators 
{pl\a£ {0, 1}, x G X} on C" satisfying 



Pi 



and 



Tr£>g) = l. (2) 

XX X 

The success probability of the protocol is given by 
\ I^A^^Tr^) 



\yey xex J 
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Let B y = B°. Since B\ 



I — By, the expression above 



simplifies to 



Tr 



(3) 



Proposition 4. Let N be a classical channel, and for 
each x g X, let n x = {N(y | x)} y ^y G R^. Then, 

Succ NS (iV) = - + --Radi{ns \x£ X} . (4) 

Note that in the above formula, we take the radius of 
{n x } as a subset of , not subset of the set of 

probability distributions on y. 



The quantity SuccQ( ra ) (N) is the maximum of this ex- 
pression over all n x n Hermitian operators {By\ y ^y sat- 
isfying < By < I and all n x n positive semidefmite Proof of Proposition^ By Proposition 14 from 



operators {pl} xe x,ae{o,i} satisfying ^ above. Apply- 
ing Lemma [T] yields the desired formula. □ 

A convexity argument (see section 2 of the supplemen- 
tary information) proves the following stronger version of 
Theorem H 

Corollary 3. The formula in Theorem^ holds also when 
the maximum is taken only over families {B y } that con- 
sist of projections on C n . 

As an example of the utility of Corollary [3l consider 
the channel M in Figure [2j which is defined in 
The input alphabet for M is {1,2,3,4}, and the out- 
put alphabet is {1,2,3,4,5,6}. In section 3 of the sup- 
plementary information, we prove that for any 2x2 
projection operators Px,...,P6, the radius of the set 
{j2l =1 M(y\x)P y | x = 1,2,3,4} is no more than ± + 
This maximum is achieved when Pi =0, P2 =1, 
and {p3,P4} and {P5,Pe} are two different Pauli mea- 
surements. Therefore, 



Succq( 2 ) (M) 



3V2' 



and the protocol from Q is optimal for 2-dimensional 
entanglement assistance. (We note that this generalizes 
the paper Q, which showed the optimality of within 
a more restricted class of protocols.) 
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FIG. 2. The channel M, from @]. 

Non-signaling devices. In order to prove more explicit 
bounds on the limits of quantum assistance, we first turn 
our attention to assistance by a non-signaling correlation. 
The next proposition asserts a formula for the optimal 
non-signaling assisted success probability of a channel. 
For any finite set of vectors S C R fe , let Radi(S') denote 
the radius of S under the 1-norm. 



Succns(-^) = 1 ~ max min (min{c y , N(y | x)} — c y /2) 

= min max > (1 — min{c„, N(y I x)} + c v /2) 
ceRy xex ^ y yl 

vey 

Using the easily proved fact that \\u — v\\ ± = J2i u i + 
Vi — 2 '^2 i min{ui, Vi}, the above formula simplifies to 

1 1 



□ 



Succns(^V) = mm max — | — - lie — n x \\, , 
which implies ([4]) above. 



Formula (J4j) allows us to relate the quantity Succns (N) 
to the quantity Succ(A r ). 



Theorem 5. Let N be a classical channel, and let r 
\X\ denote the size of the input alphabet of N . Then, 



1 / 2 

Succ NS (A0 - - < 2- - 

2 \ r 



Succ(Af) - 



1 



(5) 



Proof. Let {n x } be the vectors defined in Proposition 2] 
The unassisted one-shot success probability can be ex- 
pressed in terms of these vectors like so: 

Succ(TV) = - + - • Dia mi {n x }, (6) 

where Diami denotes diameter under the 1-norm. A 
triangle-inequality argument shows that the distance 
from the mean vector (^2 x n x ) jr to the set {n x } can- 
not exceed (l — -) Diami{n, x }. Therefore, Radi{n x } < 
(l — -) Diam!}?!^,}, which implies the desired result. □ 

Theorem [5] is the best possible in the sense that there 
are channels where equality is achieved in ([5]). Consider 
the following example, which is a generalization of the 
channel M from Figure^ Let s be a positive integer. For 
any i G {0, 1, 2, . . . , 2 s — 1}, let bi € F| denote the binary 
representation of i. Define a channel T as follows. The 
input alphabet of T is {0, 1,2, ... , 2 s — 1}, and the output 
alphabet is {1, 2, . . . , 2 s — l}x{0, 1}. On given input i, the 
channel chooses an element j G {1, . . . , 2 s — 1} uniformly 
at random and outputs the pair (j, bi ■ bj) (where bi ■ bj 
denotes the inner product of bi and bj mod 2). 

For any i € {0, 1, 2, . . . , 2 s - 1}, let l t g M 2( - r ~ 1 '> denote 
the probability vector which expresses the output of T 
on input i. It is easy to see that the diameter of {£i} is 
2 S /(2 S - 1), and thus Succ(T) = i + 2 s - 2 /(2 s - 1). On 
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the other hand, the radius of {£i} is 1, as can be seen 
from the following calculation. For any c € R 2< - 2 



2"-l 



max 

o<;<2 s - 



2~ s [2 s - 1 |^-(2 s -ir 1 |+2 s - 1 |c Jt -0|] 



1<j"<2 s -1 

te{o,i} 



>2- s J2 [2 S_1 (2 S - I)" 1 ] = 1. 



l<j<2 -1 

te{o,i} 

Thus Succns(T) — 1- (And, indeed, a perfect communi- 
cation protocol for T exists-see section 4 of the supple- 
mentary information.) The channel T achieves equality 
in <[5j) . 

The following modifiied version of Theorem [5] will be 
useful in our analysis of entanglement assistance. 

Theorem 6. Let N be a classical channel, and let D be a 
non-signaling correlation arising from a two-part device 
(Da, Db). Let m denote the size of the output alphabet 
of Da- Then, 



Succ(A^)-i<(2-l 



Succ(A) - 



1 



(7) 



Proof. A protocol for communicating a single bit a using 
N and D proceeds as follows. Alice uses a to choose 
an input to Da, and then uses a and the output of Da 
to choose an input to N. Bob uses the output of N to 
choose an input to Db, and then uses the outputs of N 
and Db together to guess the bit a. 

The optimal success probability Succ(iV, D) can be 
achieved by a deterministic protocol (i.e., a protocol in 
which Alice and Bob make their choices according to de- 
terministic functions). As there are only 2m possible in- 
puts that Alice could make to A in a deterministic proto- 
col, the success probability of such a protocol is bounded 
by (2 - 2/(2m))Succ(A) by TheoremE □ 

Binary quantum devices. Finally, we will use our 
bounds for non-signaling devices to obtain bounds for 
assistance by binary quantum devices. 

A two-part device D is local- deterministic if the output 
of each part is a deterministic function of its input. A 
non-signaling correlation is local if it is a convex combi- 
nation of local-deterministic correlations. We define the 
local fraction of a non-signaling correlation, a concept 
which is used in [§], (loj . 

Definition 7. Let D be a non-signaling correlation. The 
local fraction of D, denoted \oc(D), is the largest real 
number a € [0, 1] such that there exists a decomposition 



D = aL + (\- a)F, 



(8) 



where L is a local correlation and F is a non-signaling 
correlation. 



For any classical channel N, it is easy to see that when 
a decomposition such as ([8]) exists with L local and F 
non-signaling, 

Succ(A, D) < aSucc(A, L) + (1 - a)Succ(A, F) 
< aSucc(A) + (1 - a)Succ NS (A). 

This implies the following stronger version of Theorem [6] 

Theorem 8. Let N be a channel, and let D be a 
non-signaling correlation arising from a two-part device 
(Da, Db). Let m denote the size of the output alphabet 
of Da ■ Then 



Succ(A, D) 



Succ(A) - 



< 1 



1 



1 



(l-loc(D)). □ 



Thus, to obtain improved upper bounds on Succ(A, D) 
for quantum correlations D, it suffices to find lower 
bounds on the local fractions of quantum correlations. 
In section 5 of the supplementary material, we use facts 
about the geometry of quantum and non-signaling corre- 
lations ll| to prove the following bound for binary quan- 
tum correlations. 

Proposition 9. Let D be a binary quantum correlation. 
Then \oc(D) > 2 - a/2. 

Combining Theorem [8] and Proposition [9] yields the 
following. 

Corollary 10. For any classical channel N , 



Succ Qb (A)-i 1 
~ 2 



Succ(A) 



1 

71' 



Note that equality occurs in Corollary |TD] for the case 
discussed in [2J. 

Conclusion. We have given a formula for the 
n-dimcnsional entanglement-assisted one-shot success 
probability of a classical channel, and have shown its 
utility by using it to show that the protocol in [ij is 
optimal. We derived a more explicit bound on the ad- 
vantage gained by binary quantum correlations (which is 
an equality in the case of |2J). Along the way, we estab- 
lished a bound on the advantage gained by non-signaling 
assistance and provided an example where equality is 
achieved. 

Future research could explore methods for evaluating 
the formula from Theorem [2] (Section 3 of the supple- 
mentary information provides methods which might gen- 
eralize.) Also, it would be interesting to try to prove 
stronger bounds on the increase in Succ(iV) that is pro- 
vided by entanglement. (This might involve generaliza- 
tions of Proposition^) Another natural next step would 
be to consider the one-shot success probability for non- 
binary messages. 
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SUPPLEMENTARY INFORMATION 



1. The radius of a set of Hermitian operators 

Lemma 1. For any finite set {Hi} ie x of Hermitian operators on a finite- dimensional 
Hilbert space V , the radius of {Hi} is equal to 

max 

A;>0,A->0 
Tr(EA,) = l/2 

Proof. Any family of Hermitian operators {Hi} may be translated to a family 
{Hi + W} which contains the operator 0. This translation does not affect the 
radius nor the expression from the statement of the lemma. Therefore, we may 
assume that {Hi} contains 0. By definition, 

(1) Ra,d{Hi}i = min (r), 

C,r 
C-Hi>-rl 
Hi-C>-rl 

where the maximization is over Hermitian operators C and real numbers r. Since 
G {Hi}, whenever the constraints in this maximization are satisfied we have in 
particular that C > — rl. Letting Z = C + rl, we obtain the following alternate 
expression: 

(2) Rad{Hi}i = min (r). 

Z,r 
Z>Hi 
-Z+2rK>-Hi 

By semidefinite programming duality, this is equivalent to 

Rad{Hi}i = max [(V T^Ai-Hi) - VTr^Jf;)) . 

2Tr(EV 4 )<l 

It is easy to see that this maximum is achieved by a pair of families {Ai},{A^} 
satisfying E A « = E K and 2Tr (E K) = L D 

2. The proof of Corollary 3 in the main text 

The radius function is convex in the following sense: for any familes of operators 
{Jy} y ey and {K y } ye y, and real number a € [0, 1], 

Rad{a J y + (1 — a)K y } y 

< aRad{ J y } y + (1 — a)R&d{K y } y . 

(For, if we let J' be such that the distance from J' to {J y } y is equal to r := 
RadjJyjy, and we let K' be such that the distance from K' to {K y } y is equal to 
r' := RadjA'ylj,, then the distance from aJ' + (1 — a)K' to {aJ y + (1 — a)K y } y 
is no more than ar + (1 — a)r' by the triangle inequality.) In particular, this 



2jTr((A l - X'i)Hi) 
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convexity property implies that the radius of {aJ y + (1 — a)K y } y is no more than 
the maximum of Rad{ J y } y and R&d{K y } y . 

Since any Hermitian operator B satisfying < B < I is a convex combination 
of projection operators, Corollary 3 follows from Theorem 2. 

3. An example calculation 

Let M be the channel defined in figure 2 in the main text. In this section we 
will use Theorem 2 from the main text to calculate the quantity Succq( 2 )(-^0- 
First, we will prove the following lemma which provides a simplified formula for 
Succq(„) (M). For any projection operator P, let P x denote projection onto the 
orthogonal complement of P. 

Lemma 2. For any n > \, the quantity Succq(„)(M) is equal to 

-+ (-) max (Rad{X + Y + Z,X + Y X + Z X ,X X +Y + Z X ,X X +Y X + Z}) 
2 V 3 J x,y 7 z 

where the maximum is taken over all projection operators X, Y, Z on C n . 
Proof. For any Hermitian operators Bi, B 2 , B 3 , B4, B$, Bq on C™, let 

F(B 1 ,B2,B 3 , B4, B 5 , Bq) 

be equal to the quantity 

Rad {Bi +B 3 + B 5 , B x + B 4 + B 6 , B 2 + B 3 + B 6 , B 2 + B 4 + B 5 } . 
By the formula from Theorem 2 in the main text, 

(3) Succ Q( „)(M) = i+ (I) max F{B X , B 2 , B 3 ,B±,B 5 ,B 6 ). 
Let 

(4) m = max F(B U B 2 , B 3 , B 4 , B 5 , B 6 ). 

o<s 4 <i 

It suffices to prove that this maximum is achieved by some 6-tuple of the form 
(X, X- 1 , Y, Y- 1 , Z, Z- 1 ), where X, Y, and Z are projections. 

As noted in section [2] of the supplementary information, the radius function is 
convex in the sense that if (Hi, H 2 , H 3 , H4) and (H[, H 2 , H 3 , H' 4 ) are Hermitian 
operators and a £ [0, 1] is a real number, 

(5) Rad{aHi + (l-a)H<}i < aRM{Hi}i + (1 - a)Rad{JI-}i- 

It follows easily by linearity that a similar convexity property holds for F: for any 
Hermitian operators B\, . . . , Bq and B' l7 . . . , B' 6 , and any a £ [0, 1], 

F(aBx + (1 - a)B[, ...,aB 6 + (l- a)B' 6 ) 

< aF(Bi, . . . , B 6 ) + (1 - a)F(B[, B' 6 ). 

In particular, 

(6) F{aB x + (1 - a)B[, . . . , aB 6 + (1 - a)B' G ) 

< m^{F(B 1 ,...,B 6 ),F(B[,...,B' 6 )}. 

Additionally, F is translation-invariant in the following sense: for any Hermitian 
operators B\, . . . , Bq, and any Hermitian operators K, L, and M, 

(7) F{B 1 +K,B 2 +K,B 3 + L,B A + L,B 5 +M,B 6 +M) = F{B u ...,B e ). 
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Let Xi, X2, Yi, Y2, Z\, Z2 be Hermitian operators satisfying < X%,Yi,Zi < I 
such that F(X\ 1 X2, Yt, Y2, Z±, Z2) = rn. Let X+ and X- be a pair of positive 
semidefinite operators having mutual orthogonal supports which are such that 

(8) X 1 -X 2 =X+-X_. 
Define Y+,Y-, Z+, Z- similarly. By property above, 

(9) F{X + ,X-,Y+,Y-,Z+,Z-) = F(X 1 ,X 2 ,Y 1 ,Y 2 ,Z 1 ,Z 2 ) 

= m. 

The pair (X+ , X- ) can be expressed as a convex combination of pairs of pro- 
jections (Pj ^ , Pg ^ ) where for each i, the support of P^ is orthogonal to \ A 
similar decomposition exists for (Y+, Y-) and (2+, Z-). Therefore by property ([6|) 
above, there exist pairs of projections (Pi,P2), (Qi,Q2), (^1,^2), with each pair 
having mutually orthogonal supports, such that 

(10) F(P 1 ,P 2 ,Q 1 ,Q 2 ,Ri,R2) =m. 

Let P3 = I — Pi — P2 , and define Q3 and P3 similarly. By (J7J , 

(11) ^ (i^ + f + + f ,Q 2 + f ,«i + f + f 

The 6-tuple on the left hand side of the equation above is a convex combination of 
the 6-tuplcs 

(Pl+P 3 ,P 2 ,Ql + Q3,Q2,Pl+P3,P2) 

and (P 1; P 2 + P 3 , Qi, Q 2 + Q 3 , P x , P 2 + P 3 ) ■ 

By ([6]), at least one of these 6-tuples must achieve the maximum m. This completes 
the proof. □ 

Lemma 3. For any projection operators X , Y . Z on the two-dimensional vector 
space C 2 , the radius of the set 

(12) {X + Y + Z, X + Y 1 - + Z^-, X 1 - + Y + Z^-, X 1 - + Y 1 - + Z) . 
is less than or equal to 5 + -4=. 

Proof. Case 1: The matrices X, Y, and Z are all scalar matrices. In this 
case, each of X, Y, and Z is equal to either or I. This case is trivial, since the 
radius of the set {31,1} is 1, and the radius of the set {21,0} is 1. 

Case 2: Two of the matrices X, Y, Z are scalar matrices and one is a 
nonscalar. We may assume without loss of generality that X is the nonscalar 
matrix. Then the set (|12[) is equal to either 

(13) {0, A" + 1,21} 



(14) {X,X + 2I,1}. 

In the former case, the operator-norm distance from the operator I to the set 
{0, X + 1, 21} is 1. In the latter case, the operator-norm distance from the operator 
X + I to the set {X, X + 21, 1} is 1. The desired result follows. 



4 



SUPPLEMENTARY INFORMATION 



Case 3: Exactly one of the matrices X,Y,Z is a scalar matrix. We may 

assume that X and Y are nonscalar matrices and Z is scalar. Also, by replacing 
(X, Y, Z) with (X- 1 , Y, Z- 1 ) if necessary, we may assume that Z = I. 

Let X = \x) (x\ and Y = \y) (y\ where x,y € C 2 are unit vectors, and let 
9 = arccos (|a; ■ y\). Both of the operators 

(15) X + Y + I,X X +Y X + 1 

have eigenvalues {2 + cos 9, 2 — cos 9}, and both of the operators 

(16) X + Y ± ,X- L + Y 
have eigenvalues {1 + sin 6*, 1 — sin6>}. If wc let 



(17) 



cos ft — sm t 



I, 



then the operator norm distance from C to each of the elements of (fT2")) is \ 



cos fl+sin t 



< 



1 

V2- 



Case 4: Each of X, Y, Z is a nonscalar matrix. As in case 3, let X — \x) (x\ 

and Y = \y) (y\ and let 9 = arccos (\x • y\). 
Let 



(18) 



C = I+ - + 



cos 9 — sin 9 



Z 



sin 



Then, the operator norm of the difference 
(19) (X + Y + Z) -C= (X + Y) - 



cos — sin 9 



is i + cosfl + slng ; which is less than or equal to \ 



-k= . A similar calculation shows 

n/2 



that the distance from C to each of the other three elements of set (|12|) is equal to 



l + cose+sin^ Thig completes the proof 



□ 



For any angle 9 G M., let Pg : C 2 —> C 2 denote projection onto the unit vector 
cos(ft) |0) + sin(ft) |1). Consider the set 



(20) 



{PO + P*/4 +l,P + Psn/4, P-K/2 + P*/4,P*/2 + P S n/4 + l} 



A direct calculation shows that the distance from the operator (|j I to set (|20| is 
i + The next lemma asserts that this quantity is in fact the radius of (|2"0)) . 

Lemma 4. The radius of the set 

(21) {Po + P*/4 +l,P + Ptt/2 + Pn/4,Pn/2 + P^/A + l} 

is is equal to i + 

Proof. For any Hermitian operator H: C 2 —> C 2 , let us write _ff to denote the 
trace-zero operator H — (Tr(_ff))I/2. In the proof that follows, we will make use of 
the following fact: for any two Hermitian operators Q, R: C 2 — > C 2 , 



(22) 



\\Q-R\\ = |Tr(Q) - Tr(H)| + I |Q - R\ 
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Suppose, for the sake of contradiction, that there exists a Hermitian operator Z 
whose distance from each of the elements of set (f20|) is strictly less than \ + . 
Then, 

2 U + 4s) > \\( P 0+P3,/i)-Z\\ + \\(Pn/2+Pn/4)-Z\\ 



2 y/2 



= || (Po + Pa*/* - I) - Z\\ + || (P n/2 + P*/a - I) - Z\\ + 2 • |2 - Tr(Z)\ 

> || (Po + Pte/i) - (iV/2 + P,/i) || + 2 • |2 - Tr(Z)| 
= y/2 + 2- |2-Tr(Z)| 

Therefore, Tr(Z) < §. Similarly, 

2 G + ^) > IK P0+P -/4+I)-^|| + ||(^/2+^3V4)-l|| 

= || (PO + P-n/A - I) - ^|| + || (i^/a + - 1) - ^|| + 2 • |3 - Tr(Z)| 

> || (Po + *V«) - ( P -/2 + P 3./4) || + 2 • |3 - Tz{Z)\ 
= V2 + 2- |3-Tr(Z)|, 

which implies Tr(Z) > |. This is a contradiction. □ 

Combining Lemmas [SHll we have the following proposition. 
Proposition 5. The quantity Succq(2)(M) is equal to | + □ 

4. An example of optimal non-signaling assistance 

In this section we discuss an example in which equality occurs in Theorem 5 
from the main text. This example is a generalization of the protocol from [2]. 
Let to be a positive integer. Let 

(23) Z = F™, 

(24) W = (F™ \ {0}) x F 2 . 
Let K be a channel defined as follows: 

(1) The input alphabet of K is Z, and the output alphabet of K is W. 

(2) For any given input v G F™, the output of -KT is unformly distributed over 
the set 

(25) {(w, w v) | w e F™ \ {0}} . 

(Here, w • v G ¥2 denotes the inner product of w and v.) 

Let (Ei, E2) be a two part input-output device defined as follows. (See Figure[T]) 

(1) The input alphabet for Ei is F2, and the output alphabet for Ei is Z. 

(2) The input alphabet for E2 is W, and the output alphabet for E2 is F2. 

(3) If the inputs to E x and E 2 arc a G {0, 1} and (w, r) G (F^ \ {0}) x F 2 , then 
the output of Ei is unformly distributed over all vectors a = (ai, 02, . . . , a m ) 
that satisfy a x = a, and the output of E 2 is a © r © (w ■ a). 

It can be checked that the correlation E arising from (i?i,i? 2 ) is non-signaling. 
Additionally, one can see (by substitution) that using E to assist K yields a perfect 
transmission of a single bit. (See figure [2]) 

Now, let us calculate the quantity Succ (A"). For any two distinct vectors xo, xi G 
F™, the probability that a randomly chosen vector w G F™ s {0} will satisfy 
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w, r 



Ex 



T 

a = (a, 02, ■ ■ ■ , flm) 



afflrffi(w-a) 



Figure 1. The device (E 1 ,E 2 ) 



Alice 



Si 



(w, w • a) 



-> Bob 



Figure 2. A perfect communication protocol. 



w • xq 7^ w • xi is equal to 2 m_1 / (2 m — 1). This fact has the following consequence: 
if Alice employs the deterministic encoding strategy [0 i— > xo , 1 > xi] to send a 
single bit, then the optimal probability with which Bob can decode is 



(26) 



(27) 



2™ - 1 
2™ + 2™- 1 



(1) 



2 m - 1 



1 



2 m+l _ 2 • 

Therefore, Succ (K) is equal to quantity (|2"T1) . while Succns (-^0 is equal to 1. The- 
orem 5 from the main text asserts the following bound on Succns (K)' 



Succns (K) < - 



2-A 

2 m 



1 /2 m -l 
= - +2 

2 V 2™ 

= 1. 



Succ(X) - \ 



2m+l 



Therefore, equality is achieved in Theorem 5 from the main text when N = K . 

5. The local fraction of a binary quantum correlation 
In this section, we prove the following proposition from the main text. 
Proposition 6. Let D be a binary quantum correlation. Then loc(D) > 2 — \/2. 
Proof. For any binary non-signaling correlation G, let 

h{G)= (-l) x(SmaAv) G(xb\ay). 

a,x,b,y£{0,l} 

This is the function which defines the CHSH inequality [T]. let f 2 , /3, and f± be 
the functions defined by the same expression with a Ay replaced by —<a Ay, a A ->y, 
and -^a A ->y, respectively. 
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We note the following facts. (See [3].) 
(1) A non-signaling correlation G is local if and only if —2 < /j (G) < 2 for 



(3) There are eight non-signaling correlations {P i ; }f =1 and {P 4 }f =1 , satisfying 



These are the Popescu-Rohrlich (PR) boxes. 
(4) Every non-signaling correlation is a convex combination of local correlations 
and the eight PR boxes. Further, for any two distinct PR boxes P and P', 
the correlation (P + P')/2 is local. 
From the second part of item 0] it follows that any convex combination of local 
boxes and PR boxes can be simplified into an expression of the form aL + (1 — 
where L is local, Q is a PR box, and a € [0, 1]. Any non-signaling correlation can 
thus be expressed as a convex combination of a local correlation and a single PR 
box. 

Let D — aL + (1 — a)Q, where L is local and Q is a PR box. First suppose 
that Q = Pf. Let L fJ = (aL + (j8 - a)P+)//3. for any /3 G [a, 1]. Then L p is local 
whenever fj(Lp) < 2. If /j (Li) < 2, then Li(= -D) is local, and the proposition 
follows easily. Otherwise, there is a value (3 g [a, 1] such that fj (Lp) = 2. We have 
D = P ■ Lp + (1 - . The quantity (3 must be at least 2 - a/2, since otherwise 
© would be violated. Therefore loc(D) > 2 - \/2. 

A similar argument completes the proof in the case where Q = PJ . □ 
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(2) 



i = 1,2,3,4. 

If G is a quantum correlation, then for i = 1, 2, 3, 4, 



-2V2 < fi (G) < 2V2. 




otherwise 



2011. 



